06 Network Graphs

Visualizing and working with network graphs is a common problem in many different disciplines. HoloViews provides the ability to represent and visualize graphs very simply and easily with facilities for interactively exploring the nodes and edges of the graph, especially using the Bokeh plotting interface. It can also make use of Datashader for plotting large graphs, and NetworkX for some convenient graph functions:

In [1]:
import numpy as np
import pandas as pd
import holoviews as hv
import networkx as nx

hv.extension('bokeh')

%opts Graph [width=400 height=400]

The HoloViews Graph Element differs from other elements in HoloViews in that it consists of multiple sub-elements. The Graph element itself holds the data that indicates whether each node is connected to each other node. By default the element will automatically compute concrete x and y positions for the nodes and represent them using a Nodes element, which is stored on the Graph. The abstract edges and concrete node positions are sufficient to render the Graph by drawing straight-line edges between the nodes. In order to supply explicit edge paths we can also declare EdgePaths , providing explicit coordinates for each edge to follow.

To summarize, a Graph consists of three different components:

  • The Graph itself holds the abstract edges stored as a table of node index pairs.
  • The Nodes hold the concrete x and y positions of each node along with a node index . The Nodes may also define any number of value dimensions, which can be revealed when hovering over the nodes or to color the nodes by.
  • The EdgePaths can optionally be supplied to declare explicit node paths.

A simple Graph

Let's start by declaring a very simple graph connecting one node to all others. If we simply supply the abstract connectivity of the Graph , it will automatically compute a layout for the nodes using the layout_nodes operation, which defaults to a circular layout:

In [2]:
# Declare abstract edges
N = 8
node_indices = np.arange(N)
source = np.zeros(N)
target = node_indices

padding = dict(x=(-1.2, 1.2), y=(-1.2, 1.2))

simple_graph = hv.Graph(((source, target),)).redim.range(**padding)
simple_graph
Out[2]:

Accessing the nodes and edges

We can easily access the Nodes and EdgePaths on the Graph element using the corresponding properties:

In [3]:
simple_graph.nodes + simple_graph.edgepaths
Out[3]:

Supplying explicit paths

Next we will extend this example by supplying explicit edges:

In [4]:
def bezier(start, end, control, steps=np.linspace(0, 1, 100)):
    return (1-steps)**2*start + 2*(1-steps)*steps*control+steps**2*end

x, y = simple_graph.nodes.array([0, 1]).T

paths = []
for node_index in node_indices:
    ex, ey = x[node_index], y[node_index]
    paths.append(np.column_stack([bezier(x[0], ex, 0), bezier(y[0], ey, 0)]))
    
bezier_graph = hv.Graph(((source, target), (x, y, node_indices), paths)).redim.range(**padding)
bezier_graph
Out[4]:

Interactive features

Hover and selection policies

Thanks to Bokeh we can reveal more about the graph by hovering over the nodes and edges. The Graph element provides an inspection_policy and a selection_policy , which define whether hovering and selection highlight edges associated with the selected node or nodes associated with the selected edge. These policies can be toggled by setting the policy to 'nodes' (the default) or 'edges' .

In [5]:
bezier_graph.options(inspection_policy='edges')
Out[5]:

In addition to changing the policy, we can also change the colors used when hovering and selecting nodes:

In [6]:
%%opts Graph [tools=['hover', 'box_select']] (edge_hover_line_color='green' node_hover_fill_color='red')
bezier_graph.opts(plot=dict(inspection_policy='nodes'))
Out[6]:

Additional information

We can also associate additional information with the nodes and edges of a graph. By constructing the Nodes explicitly we can declare additional value dimensions, which are revealed when hovering and/or can be mapped to the color by specifying the color_index . Similarly, we can associate additional information with each edge by supplying a value dimension to the Graph itself.

In [7]:
%%opts Graph [color_index='Type'] (cmap='Set1')
node_labels = ['Output']+['Input']*(N-1)
edge_labels = list('ABCDEFGH')

nodes = hv.Nodes((x, y, node_indices, node_labels), vdims='Type')
graph = hv.Graph(((source, target, edge_labels), nodes, paths), vdims='Label').redim.range(**padding)
graph + graph.opts(plot=dict(inspection_policy='edges'))
Out[7]:

If you want to supply additional node information without speciying explicit node positions you may pass in a Dataset object consisting only of various value dimensions.

In [8]:
%%opts Graph [color_index='Label'] (cmap='Set1')
node_info = hv.Dataset(node_labels, vdims='Label')
hv.Graph(((source, target), node_info)).redim.range(**padding)
Out[8]:

Working with NetworkX

NetworkX is a very useful library when working with network graphs, and the Graph Element provides ways of importing a NetworkX Graph directly. Here we will load the Karate Club graph and use the circular_layout function provided by NetworkX to lay it out:

In [9]:
%%opts Graph [tools=['hover']]
G = nx.karate_club_graph()
hv.Graph.from_networkx(G, nx.layout.circular_layout).redim.range(**padding)
Out[9]:

Animating graphs

Like all other elements Graph can be updated in a HoloMap or DynamicMap . Here we animate how the Fruchterman-Reingold force-directed algorithm lays out the nodes in real time.

In [10]:
%%opts Graph
G = nx.karate_club_graph()

def get_graph(iteration):
    np.random.seed(10)
    return hv.Graph.from_networkx(G, nx.spring_layout, iterations=iteration)

hv.HoloMap({i: get_graph(i) for i in range(5, 30, 5)},
           kdims='Iterations').redim.range(x=(-1.2, 1.2), y=(-1.2, 1.2))
Out[10]:

Real world graphs

As a final example let's look at a slightly larger graph. We will load a dataset of a Facebook network consisting a number of friendship groups identified by their 'circle' . We will load the edge and node data using pandas and then color each node by their friendship group using many of the things we learned above.

In [11]:
%opts Nodes Graph [width=800 height=800 xaxis=None yaxis=None]
In [12]:
%%opts Graph [color_index='circle']
%%opts Graph (node_size=10 edge_line_width=1)
colors = ['#000000']+hv.Cycle('Category20').values
edges_df = pd.read_csv('../data/fb_edges.csv')
fb_nodes = hv.Nodes(pd.read_csv('../data/fb_nodes.csv')).sort()
fb_graph = hv.Graph((edges_df, fb_nodes), label='Facebook Circles')
fb_graph = fb_graph.redim.range(x=(-0.05, 1.05), y=(-0.05, 1.05)).opts(style=dict(cmap=colors))
fb_graph
Out[12]:

Bundling graphs

The datashader library provides algorithms for bundling the edges of a graph and HoloViews provides convenient wrappers around the libraries.

In [13]:
from holoviews.operation.datashader import datashade, bundle_graph
bundled = bundle_graph(fb_graph)
bundled
Out[13]:

Datashading graphs

For graphs with a large number of edges we can datashade the paths and display the nodes separately. This loses some of the interactive features but will let you visualize quite large graphs. If the number of edges is much greater than the number of nodes, using datashader to render the edges still lets you interact with each node for hovering, even though the connections are now drawn as an image:

In [14]:
%%opts Nodes [color_index='circle'] (size=10 cmap=colors) Overlay [show_legend=False]
datashade(bundled, normalization='linear', width=800, height=800) * bundled.nodes
Out[14]:

Applying selections

Alternatively we can select the nodes and edges by an attribute that resides on either. In this case we will select the nodes and edges for a particular circle and then overlay just the selected part of the graph on the datashaded plot. Note that selections on the Graph itself will select all nodes that connect to one of the selected nodes. In this way a smaller subgraph can be highlighted and the larger graph can be datashaded to reduce the file size.

In [15]:
%%opts Graph (node_fill_color='white')
datashade(bundle_graph(fb_graph), normalization='linear', width=800, height=800) *\
bundled.select(circle='circle15')
Out[15]:

To select just the nodes that are in 'circle15' set the selection_mode='nodes' overriding the default of 'edges':

In [16]:
bundled.select(circle='circle15', selection_mode='nodes')
Out[16]:

Right click to download this notebook from GitHub.